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AB&..ACT 


'^The  generic  problem  of  selecting  the  sequence  of  sensors  which 
optimizes  the  information  received  about  a  number  of  discrete  hypotheses 
is  considered.  The  optimization  criterion  penalizes  the  uncertainty 
present  about  pairs  of  hypotheses  in  a  form  which  has  an  eigenfunction 
property  with  respect  to  a  Bayes  update  of  the  conditional  probability 
distribution.  Application  of  the  Portryagin  minimum  principle  yields 
elegant  solutions  to  an  interesting  class  of  problems.  Applications  in 
surveillance,  failure  detection,  and  nondestructive  testing  are  possible 
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OPTIMAL  SENSOR  SCHEDULING  FOR  MULTIPLE  HYPOTHESIS  TESTING 

I.  Introduction 

The  problem:  Often  several  competing  hypotheses  exist  about  the  state 
of  a  particular  entity,  and  real  time  observations  must  be  used  to  discriminate 
between  them.  Once  the  set  of  sensors  to  be  used  has  been  specified,  the 
observations  can  be  used  to  update  prior  information  in  a  number  of  ways, 
although  Bayes'  theorem  underlies  some  of  the  most  common  techniques  [1] . 

In  this  framework,  the  net  information  is  captured  in  the  posterior  probability 
distribution  over  the  hypotheses. 

In  cases  where  several  sensors  are  available  but  are  mutually  exclusive 
in  their  use  (either  due  to  interference,  or  because  one  physical  sensor  must 
be  pointed  in  one  of  a  number  or  directions) ,  an  additional  problem  arises 
in  determining,  also  in  real  time,  that  sequence  of  sensors  which  should  be 
activated  to  provide  the  above  information.  The  efficacy  of  a  particular 
sensor  sequence  must  be  related  to  the  character  of  the  resulting  posterior 
probabilities,-  these  should  clearly  discriminate  among  the  hypotheses. 

Mathematically,  this  can  be  viewed  as  a  problem  of  selecting,  at  each 
point  in  time,  one  of  M  sensors  to  obtain  information  about  a  set  of  K 
hypotheses.  By  defining  an  interesting  cost  function  on  the  se  of  posterior 
distributions,  we  can  seek  an  optimal  sensor  scheduling  procedure. 

Applications :  A  number  of  generic  application  problems  exhibit  this 
structure.  In  surveillance  problems  [e.g.  2) ,  the  hypotheses  would  present 
the  presence  and  type  of  target  that  exists  at  each  point  in  the  surveillance 


volume,  and  the  sensors  would  be  radars  or  detectors  which  must  be  pointed 
in  azimuth  and  elevation.  In  failure  detection  and  identification  (3] ,  the 
hypotheses  may  represent  different  types  of  failure  and  onset  time  of  the 
failure,  in  nondestructive  fault  localization  [4] ,  the  hypotheses  are 
types  and  locations  of  faults  in  some  medium,  and  the  sensors  report  the 
attenuation  of  energy  as  it  passes  through  the  medium  in  some  direction 
to  be  specified.  Search  problems  [5,6]  fall  into  this  structure  as  the 
hypothetical  locations  of  an  object  are  observed  from  various  vantage 
points. 

Perspective :  A  number  of  authors  have  addressed  scheduling  problems 
in  this  context.  Some  make  special  assumptions  regarding  the  observation 
probability  distributions  [7,8] ,  others  treat  the  more  general  problem 
[9,10].  However,  two  common  threads  connect  these  approaches:  they 
obtain  feedback  laws  mapping  posterior  destributions  into  sensor  selections 
and  they  use  optimization  criteria  based  on  weighted  probabilities  of  error 
(i.e.  Bayes'  risk  criteria).  A  common  result  of  these  two  factors  is  a 
very  ungainly  decision  rule  unless  relatively  restrictive  assumptions  are 
made. 

The  point  of  departure  of  the  solution  found  here  is  precisely  these 
two  characteristics.  This  solution  is  content  with  being  open  loop 
(although  it  is  often  actually  easier  to  use  in  an  open  loop  feedback 
manner) ,  and  is  based  on  a  criterion  which  measures  the  uncertainty  in 
the  posterior  distribution  (and  incidentally  provides  an  upper  bound  on 
weighted  probabilities  of  srror) . 

Overview:  The  strategy  taken  in  the  sequel  is  to  first  pose  the 
problem  in  discrete  time,  with  emphasis  on  the  new  cost  function  and  its 


interpretation.  Then  the  stochastic  problem  is  reduced  to  a  deterministic 
control  problem,  in  continuous  time,  with  a  convex  control  set.  The 
Pontryagin  minimum  principle  can  be  brought  to  bear  on  the  problem,  and 
the  resulting  necessary  conditions  for  the  optimal  schedule  define  a  two 
point  boundary  value  problem  with  sectorwise  linear  dynamics.  The  general 
structure  of  the  solution  is  then  available;  certain  structural  assumptions 
lead  to  an  iterative  solution  for  the  general  case,  and  an  elegant  solution 
for  a  more  restricted  set  of  problems.  An  interesting  side  result  is  a 
geometrical  characterization  of  each  sensor  by  a  vector  of  parameters 
decribing  its  capabilities  to  distinguish  between  various  pairs  of 
hypotheses. 

II.  Problem  Statement. 

Hypotheses:  The  K  hypotheses,  one  of  which  may  be  valid,  are  denoted"* 
Hj^,  k  =  1 , . . . ,K.  Prior  knowledge  provides  a  probability  distribution  at 
time  t  *  0,  denoted  by  IT  (0)  ,  where 

V0)  ■  pCV  a> 

Observations :  Sensor  outputs  obtained  at  time  t  from  sensor  j  are 

wmM 

denoted  (t) .  The  statistics  of  (t)  are  independent  of  everything 
except  the  sensor  j  and  the  underlying  hypothesis  H^;  in  particular 

pfrj'.t)!^,  y^s))  -  p(y.,  (t)|  H^,)  s  jt  t  (2) 

(In  continuous  time,  this  implies  that  (t)  is  a  process  with  independent 
increments  [11] ;  the  continuous  time  case  will  be  addressed  more  fully  in 
section  III) .  However,  no  assumption  of  stationarity  of  these  distributions 

3 - 

Notation  is  reviewed  in  Appendix  A. 
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nee<5  be  made . 
If 


rk(t)  =  pO^lyd) . y  (t) ) 

then  by  Bayes’  law 


(3) 


P  (y  (t+i)  I  h,  )  u  <t) 

IT.  (t+l)  =  - - - - - - - 

p(y(t+l)) 


(4) 


where  subscripts  denoting  sensor  choices  have  been  omitted.  Equations  (1) 
and  (4)  give  the  dynamic  equations  for  the  evolution  of  the  posterior 
distribution  as  observations  are  obtained. 


Cost:  The  objective  of  a  detection  or  identification  algorithm  is  to 
produce  correct  estimates  of  the  true  state  of  a  system.  It  is  also  bene¬ 
ficial  if  these  estimates  come  with  high  confidence  levels.  Thus,  if  one 
is  seeking  to  drive  posterior  distributions  to  some  values,  the  best  values 
are  near  the  extremes,  where  the  true  hypothesis  is  known  with  almost 
certainty. 

Consider  the  binary  hypothesis  case.  Figure  1  shows  three  candidate 
penalty  functions  which  have  reasonable  qualitative  characteristics  - 
they  are  all  minimum  at  the  extremes  and  convex  downward.  Number  1  is  the 
minimum  probability  of  error  incurred  if  a  decision  between  and  H ^  had 
to  be  made.  Number  2  is  a  direct  measure  of  the  uncertainty  in  the 
distribution:  it  is  the  entropy  (scaled  by  1/2) 


'  7  *  \log2irk 

k=l 


(5) 


The  third  is  similar  to  the  second 


vfn^,  tt 2 J  “  it 2 


(6) 


Note  that  the  last  two  indicate  that  an  improvement  in  the  probability  of 


FIGURE  1 

Candidate  Penalty  Functions  on  7r  -  Binary  Case 
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error  from,  fay,  10%  to  1%  is  much  more  rewarding  than  one  from  49%  to  40%, 
and  thus  greatly  encourage  extremal  distributions. 

The  third  form  posesses  unique  analytical  properties,  as  will  be  seen 
in  section  III.  It  can  be  generalized  to  the  form 


V  (it  ^ .  ,  TT^) 


K 

n 

k=l 


(7) 


K 

1  -  *  rk  rk  1  0 

k=l 

and  to  sums  of  terms  of  this  form  without  compromizing  these  properties. 
Definition:  An  obscurity  function  v(tt)  is  of  the  form 

^  N 

v(ir)=Eb.v.  (tt)  (8) 

i=l  1 

with  each  v^Cir)  having  the  form  of  (7). 

The  obscurity  function  measures  the  lack  of  knowledge  abou  the  hypotheses. 
It  is  minimum  when  tt  is  pure,  i.e.  when  all  but  one  component  are  zero. 

The  coefficients  represent  weights  attached  to  varying  types  of  obscurity 
as  indicated  by  the  form  of  the  associated  v^(tt). 

Example:  Consider  the  ternary  hypothesis  testing  problem.  Two  candidate 
obscurity  functions  are 


1 

V*  (IT )  = 

,  1/3 

(TT1  1T2  TTi) 

(9) 

and 

V2  (TT)  = 

(V2)1/2  +  (TTlV1/2+  V1/2 

(10) 

Both  are  zero  for  pure  it ;  both  have  their  maxima  at  u  *  [1/3  1/3  1/3]  . 
However,  the  former  includes  as  minima  all  distributions  with  one  component 
zero,  the  latter  has  only  three  minimum  points  (Figure  2) .  The  former  is 


minimized  when  any  hypothesis  is  eliminated;  the  latter,  when  any  hypothesis  is 


confirmed. 

The  above  definition  is  quite  general;  in  the  sequel  we  will  assume  all 
terms  in  the  obscurity  function  are  of  the  form 


-»  1/2 

v.  (it)  =  (it.  tt.  )  '  (11) 

All  results  will  generalize  to  the  earlier  case,  but  this  makes  clear  that 
each  term  in  v(-jr)  represents  the  degree  to  which  a  pair  of  hypotheses  can 
be  distinguished,  and  thus  a  different  type  of  obscurity. 

The  selection  of  the  obscurity  function  provides  a  great  deal  of  flexibility. 
For  instance,  if  one  is  only  interested  in  determining  whether  or  not  is 
true,  a  function  of  the  form 


v(iri) 


K 

y 

k=2 


[*1  V 


1/2 


(12) 


is  appropriate,  as  it  penalizes  ambiguity  between  and  any  other  hypothesis 
without  including  the  obscurity  between  the  others. 


Special  Cases:  There  are  two  special  cases  of  obscurity  function  of  note. 
Definition;  The  uniform  obscurity  function  is 


u 

v 


(tt) 


"  K  1  1/2 
E  I  it.  )  ' 

k  =1  k2=l  *1  2 


k2  *  k1 


(13) 


The  uniformity  stems  from  the  equal  penalizing  of  all  pairs  of  hypotheses; 
.  u 

useful  interpretation  of  v  is  given  m 
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Theorem  1:  v  (ir)  is  an  upper  bound  to  the  minimum  probability  of  error 
achieved  a  decision  .rule  selecting  an  estimate  of  at  each  point  in 
time,  based  on  the  distribution  tt. 


Proof:  The  minimum  probability  of  error  is  achieved  by  selecting  the 

A. 

which  has  maximum  probability  ir^;  the  resulting  error  probability  is 

A 

1  -  Now 


vU  (^ )  =  I 


K 

Z 


kri  v1 

k2^i 


1  ,  ,1/2 

7  (Trv  ) 

1  2 


K 

z 

k2-l 

A 


(lTk 


1/2 


(14) 


K 

z 

V1 

A 

k2/k 


(1 


V 


The  second  special  case  concerns  a  set  of  hypotheses  which  is  the 
Cartesian  product  of  two  or  more  sets  of  subhypotheses.  If  the  posterior 
distributions  on  the  composite  set  always  factor  into  distributions  on  the 
component  sets,  a  natural  obscurity  function  arises  which  additively  decomposes. 
Let  be  the  composite  hypothesis  (H^  H^) ,  where  superscripts  denote  the 

component  hypothesis  sets.  Independence  of  all  posterior  distributions 


Wk»(t) 


U5) 


-Il¬ 


ls  implied  by  the  conditions 


(a) 

Vr  Vl0) 

-  ’>» 

V0) 

(15) 

(b) 

vj  :  (t) 

i  <\  h >  > 

=  pty^t)!^) 

(17) 

or 

(t) 

•  tHk  V> 

=  p(yj(t)!H£) 

That  is,  the  prior  distribution  has  the  component  hypotheses  independent, 
and  subsequent  observations  change  the  distribution  on  either  or  Hj  ,  but 


never  both. 


Definition:  A  Cartesian  obscurity  function  v  (tt)  is  of  the  form 


c  -*>  1  -a  2  -» 

v  (ir)  =  v  (Tf)  +  v  (ir) 


K  K 

vV)  =  Z  Z  b  v 

k  =l  k  =1  1'  2  £=1  “l*  "-2 


1  (7rk  £  *k  £> 


1/2 


2  L  L  2  K 
v  (it)  =  E  E  b,  ,  r  ,  ) 

£  =1  £  *1  *1  x2  ^  K  X1  K  *"2 

12  k=l 


1/2 


(18) 

(19) 

(20) 


Independence  (15)  then  implies 


1  _i  K  ?  l  1  2  1/2 

V1(’T)  =  E  Z  b  .  0C  TT  ) 

k  =1  k  =1  1  2  1  *2 


(21) 


and  similarly  for  v  (ir) .  The  Cartesian  obscurity  function  thus  additively 
decomposes  into  separate  obscurity  functions  defined  on  the  two  component 
sets. 

This  result  will  be  used  in  an  example  in  section  VI. 

Conclusion:  The  above  discussion  has  dwelt  on  the  interpretation  and 
structure  of  the  obscurity  function.  Assuming  it  provides  a  reasonable 


measure  of  the  poorness  of  the  state  of  information  defined  '  y  the  posterior 
distribution  if(t) ,  the  objective  of  minimizing  its  sum  over  some  time  horizon 
is  appealing.  The  optimization  problem  is  thus:  select  the  sequence  of 
sensors  j (t)  from  which  observations  are  taken,  to  minimize  the  functional 

T  _» 

E  {  I  V  (-TT  (t)  ) }  (22) 

t=l 

where  the  expectation  is  with  respect  to  observations  y, (t) .  The  optimisation 
is  subject  to  the  initial  conditions  (1) ,  dynamics  (4) ,  and  distributions  on 
observations (2) . 


Ill .  Reduction  to  a  Continuous  Deterministic  Optimal  Control  Problem 

The  problem  stated  above  is  a  stochastic  optimization  problem,  where  the 
original,  imperfectly  observable  state  H  has  been  replaced  with  the 
conditional  probability  tt,  which  can  be  determined  exactly.  State  transitions 
are  still  stochastic,  due  to  the  appearance  of  y(t)  in  (4),  but  possess 
a  Markov  property.  This  is  a  standard  approach  [12]  to  dealing  with  this 
type  of  problem;  the  next  step  might  be  to  use  dynamic  programming  to  obtain 
a  feedback  solution,  where  j (t)  would  be  selected  on  the  basis  of  TV  (t) . 

Due  to  the  lack  of  success  of  this  approach  in  producing  implementable 
solutions  for  the  general,  multiple  hypothesis  problem  consider  a  less 
ambitious  goal:  finding  the  optimal  open  loop  schedule  (i.e.  select  the 
best  sequence  of  sensors  based  only  on  the  prior  distribution) .  Not  only 
are  solutions  of  this  form  applicable  in  some  cases  where  feedback  solutions 
cannot  be  implemented,  but  they  can  be  used  as  open  loop  feedback  solutions 
where  the  entire  schedule  is  effectively  recomputed  at  each  time,  using  TT(t) 
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‘3 

as  the  prior  distribution,  and  tne  first  selection  given  by  that  schedule  is 
implemented. 

Reduction  to  a  deterministic  problem  in  this  case  involves  performing 
the  expectation  in  (22).  This  results  in  a  discrete  time  problem  and  a 
characterization  of  each  sensor  by  a  set  of  coefficients,  After  some 
discussion  of  the  interpretation  of  these  coefficients,  a  continuous  time 
approximation  will  be  constructed  for  further  study  (or,  the  continuous 
time  problem  can  be  found  directly  in  the  case  of  continuous  time  observations) . 

After  a  final  detail,  where  the  control  set  is  extended  from  discrete  points 
to  a  connected  set,  the  equivalent  deterministic  optimal  control  problem  is 
presented. 


Reduction  to  Deterministic  Dynamics;  The  form  of  the  obscurity  function  was 
selected  for  its  qualitative  properties  and  because  of: 

Theorem  2:  Functions  of  the  form  (7)  are  eigenfunctions  of  the  expectation/ 

Bayes  update  operation  and  the  associated  eigenvalue  is  completely  determined 
by  sensor  characteristics. 

Proof:  Let^ 

v^ifct+l),  t+1)  »  xi(t+l)  (tt^  ir2^1/'2 


Then 


V  (lT(t),  t)  ■  E  {v,  (tT  (t+1)  ,  t+1)} 

yCt+1) 


(23) 


with  it  (t+1)  given  by  (4).  Substituting, 


vi(ir(t),  t) 


f 


/p(y\  (t+1)  ]h.)  it .  (t)  p(y .  (t+1)  |  H2)7r2(t) 

p(74(t+l)  x^t+l)^ - 3 - 3 - ~ j - 3 - — 


'j 


<P (Yj (t+1) ) ) 


) 


1/2 


a^tt+l)  xi(t+l)  (TT^t)  m2(t)) 


1/2 


(24) 


**  For  notational  simplicity  here,  assume  the  iC^  term  of  the  obscurity  function 
involves  and  H2  • 


,1... 
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where  j  is  the  sensor  selected  for  t+1  and 


a. . (t+1)  = 


_» 

j  ^  (p(Yj (t+1) |hx)  p(y. 


(t+1) |h2))1/2  dyj 


(25) 


By  (24),  v^  (ir  (t+1)  ,t+l)  is  an  eigenfunction  of  the  update  (23),  with  <X^  (t+1) 
is  eigenvalue. 


Now,  for  a  fixed  sequence  of  sensors 
j  «  (j(t),  t  -  1,...,t)  ,  define  the  expected  cost  to-go  at  time  t  with 
conditional  distribution  it  (t)  as 


V~  (ir\t) ,  t)  -  E{  Z  v(tUs))  ) 
D  s-t+1 


(26) 


where  V~(tt(T),T)  -  0  at  the  terminal  time.  The  key  result  is  then 


Theorem  3:  At  each  time  t,  the  cost-to-go  takes  the  form 
-a  N 

Vt  (it  (t) , t)  -  Z  x.  (t)v.  (IT) 

3  1-1  1  i 

where 

x^(t)  -  ot^  (t+1)  x^ (t+1)  + 

Xi(T)  -  0 


(27) 


(28) 

(29) 


Proof:  By  reverse  induction.  At  t  *  T,  (29)  implies  the  cost-to-go  is 
uniformly  zero.  Assume  (27)  holds  at  time  t+1, so 
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VT  (ir  Ct) , t)  =  E  {Vr  (tt  (t+1) ,  t+1)}  +  b;.  vi(1T(t))  by  (8,22) 

N 

=  E  {  z  xi(t+l)  (t+1) )  >  +  bt  vt(ir(t))  by  (27) 

i=l 


Z  a.  ,  (t+1)  x^(t+l)  vi(tr(t))  +  (ir  (t) )  by  Thm  2 

i=l  13  1 


I  (ai .  (t+1)  xi(t+l)  +  b^)  v^mtt)) 
i=l  J  - 


and  v^(ir)  is  of  the  fundamental  form  (7). 


This  gives  a  deterministic  linear  dynamical  problem  (28,29)  with  states 
x^(t)  representing  the  amplitudes  of  a  finite  number  of  modes  of  the  cost- 
to-go  function  excited  by  the  terms  of  the  obscurity  function.  The  co¬ 
efficients  _.  (t)  represent  the  decay  of  x.^  (t)  when  sensor  j  is  selected 
at  time  t,  and  the  driving  terms  b^  representing  the  relative  importance 
of  each  term.  Moreover,  the  x^ (t)  are  truely  states  as  their  evolution 
depends  only  on  selections  j  made  betweeen  t  and  T,  although  this  property 
holds  in  reverse  time. 


Corollary  3a:  The  total  cost  is 


N 

V  (7T(0),0))  -Ex  (0)  V.  (IT  (0)  ) 

j  i-1 


(30) 


Proof:  Immediate  from  Theorem  3  when  t  -  0. 


If  (known)  parameters  c^  are  defined  as 


C.  =  V;.(TT(0)) 

then  the  equivalent  deterministic  optimal  control  problem  is  to  select  j 


(31) 


t  - 
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to  minimize 


subject  to 


E  x  <0)  c. 
i-1  A 


X^t-1)  =  CL^tt)  +  bi 


(32) 


(33) 


x^T)  =  0 


interpretation  of  the  g  „ :  These  parameters  measure  the  ability  of  sensor  j 
to  contribute  to  the  reduction  of  each  term  of  the  obscurity  function. The 
set  { cUj  |i  =  describe  the  information  gathering  ability  of  j 

in  all  directions  which  are  contained  in  v(tr).  For  example,  sensor  1  may  be 
able  to  distinguish  from  H 2  and  but  not  between  the  latter,  while 
sensor  2  only  separates  from  H^.  The  information  from  each  sensor  alone 
is  incomplete:  the  set  above  paves  the  way  towards  a  geometric  interpretation 
of  information. 

Gross  properties  of  the  a. .  are 

13 


Theorem  4 :  For  all  i , j , t , 


0  <  a.  .  (t)  <  1 
-  jo'  - 


(34) 


with  the  lower  limit  obtained  iff  it  is  possible  to  completely  eliminate  one 

u 

of  the  hypotheses  in  vi  with  any  single  observation  y ^ (t) ,  and  the  upper 

_»  _ 
iff  y ^  (t)  is  independent  of  the  hypotheses  in  v^  (it)  . 


_J  _ |  _ fr 

Proof:  Since  p Cy^ (t) | H^)  >_  0  for  all  ♦  for  all 


l 


s* 


^p(yj(t)|Hk  )  p(y\  (t)  Ift^  ))1/2dy  >  o 

m  2 


(35) 
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with  equality  iff 

p(y  (t)|H  )  P(y  (t)|H  )  =  0  (36) 

3  1  3  *2 

for  all  y  (t) ,  i.e.  iff  the  set  of  y ^ (t)  which  may  result  when  H2  is  true 

_» 

is  disjoint  from  that  possible  when  is  true,  and  hence  y^ (t)  provides 
perfect  information  to  distinguish  between  them.  Since  also 


(37) 


the  integral 


with  equality  iff 


(38) 


p(y  (t)  | ft  )  -p(y.(t)|lL  )  (39) 

3  *1  3  *2 

for  all  y . (t) , 

3  O 

Thus,  qualitatively  speaking,  good  schedules  use  sensors  where  the 
are  3mall  for  terms  where  c^  or  are  large. 

In  preparation  for  the  transition  to  continuous  time,  introduce 


Definition;  The  clarification  coefficient  of  sensor  j  with  respect  to 
v^  (tA  is  ai .  (t) 

a^tt)  =  -In  j  (t)  (40) 

Corollary  4a:  Clarification  coefficients  are  nonnegative  and  unbounded 
with  equality  to  zero  holding  iff  the  sensor  produces  outputs  which  are 
independent  of  the  hypotheses  of  the  associated  term. 


T 
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Proof:  Properties  of  In.  D 

Appendix  B  contains  formulae  for  the  clarification  coefficients  for  two 
common  observation  processess:  Poisson  and  Gaussian. 

Reformulation  in  Continuous  Time:  The  remainder  of  this  section  deals  with 
improving  the  analytic  properties  of  the  problem  by  replacing  the  discrete 
time  and  control  sets  with  continuous  equivalents.  The  problem  as  posed 
(32,33)  can  be  solved  using  the  discrete  time  minimum  principle  [13] ,  but 
the  solution  has  implicit  properties  which  are  less  cumbersome  in  a 
continuous  time  framework. 

Consider  the  formal  continuous  time  analog  of  (32,33):  minimize 


N 

£  c,  x. (0) 
i=l  1 


(41) 


with 


dx. 

x 


d(-t) 


*  -aij  (t)  x  (t)  +  b^t) 


x.,(T)  *  0 


(42) 


where  again  the  dynamics  appear  in  reverse  time.  Integrating  (42)  from 
time  t  to  time  t  -  5  gives  approximately 


x^t-5)  -xi<t)  *  -ai;.(t)  xi(t)6  +  b^t) 


(43) 


x^(t-6)  =  (1  -  ai^(t)6)  x^t)  +  b^ttjS 


~  e 


-a^  (t)6 


(t)  +  b^  (t)5 


(44) 


provided  5  is  sufficiently  small  that  second  order  terms  can  be  neglected, 
i.e. 


a^  (t)6  <<  1 


(45) 
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Setting  <S  =  1 

x(t-l)  ~  a^tt)  xi(t)  +  b^t)  (46) 

Provided  the  a.  (t)  are  the  clarification  coefficients  (4)  and  a. . (t)  the 
ij  13 

eigenvalues  (25),  (41,42)  may  be  valid  approximations. 

The  approximations  require  (45)  to  hold  when  <S  is  the  unit  discrete  time 
interval.  If  it  is  invalid,  decreasing  6  is  suggested  -  i.e.  increase  the 
discrete  time  sample  rate.  The  principal  effect  of  this  is  to  create  more 
opportunities  for  changing  the  sensor  selection  -  i.e.  making  switch  times 
into  more  continuous  variables.  As  5  -*■  0,  the  approximation  becomes  exact 
this  can  be  shown  specifically  for  continuous  time  Gaussian  processes  either 
using  [14]  or  more  direct  techniques  (Appendix  C) . 


Convexification  of  Control  Variable:  In  the  problem  thus  far  there  have  been 
a  discrete  set  of  sensors  from  which  to  select.  It  will  be  convenient  to 
convexify  this  set  by  introducing  the  M  control  variables  u_,  (t) ,  which 
specify  what  fraction  of  am  infinitesimal  cycle  is  devoted  to  each  sensor  j . 
Thus 

M 

Z  u.  (t)  =  1  u.  (t)  >_  0  (47) 

1-1  1  3 

are  the  constraints  which  admissible  controls  must  satisfy. 

With  this  interpretation,  (44)  becomes 

-Z  a„ (t)Uj (t)5 

x^(t-S)  -  e  ^  x^(t)  +  b^6  (48) 


or. 


as  6 


As  the  discrete  sample  rate  goes  to  zero,  this  expression  is  exact  for  all 
stationary  processes,  as  well  as  for  nonstationary  Gaussian  and  Poisson 
processes.  It  provides  a  piecewise  linear  approximation  for  other  sampled, 
nonstationary  independent  increments  processes;  however,  the  sequel  will 
assume  ct.,(t)  to  be  twice  differentiable  and  thus  a  more  advanced  approxi¬ 


mation. 


ij 


such  as  splines,  may  be  necessary. 


r 
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where 


dPir  =  _ai(u)  x±(t)  +  bi 


a.  (u)  *  I  a. . (t)  u,  (t) 

j=l 


(49) 


(50) 


This  convexification  of  the  control  set  allows  the  above  interpretation 
of  polling  sensors  with  being  the  fraction  of  time  devoted  to  sensor  j . 

Mixed  controls  (some  £{0,l})  do  arise  in  the  optimal  solution.  Were  a 
solution  attempted  without  convexification,  the  optimal  solution  would  still 
be  forced  to  achieve  this  mixture  by  infinitesimal  "time  sharing" .  In 
practice,  either  this  polling  can  be  approximated  or,  in  open  loop  feedback 
uses,  it  will  almost  never  occur  as  the  set  of  t^(0)  for  which  it 
is  initially  required  is  of  measure  zero. 

Reversal  of  the  Time  Index:  Finally,  the  reverse  dynamics  that  naturally 

arose  in  (32,33)  and  (40,41)  are  a  notational  nuisance;  replacing  the  time 
variable  t  with  another  t" 

t'  *  T  -  t  (51) 

yields  an  identical  problem  more  in  line  with  standard  optimal  control  problems. 
The  only  caveat  is  that  the  solution  to  the  resulting  problem,  u*(f*),  is 
the  reverse  of  the  optimal  schedule. 

Conclusion;  This  section  has  reduced  the  original  problem  (1,4,8,22)  to 
an  equivalent  problem:  minimize 


Z  c.  x.  (T) 
i=l  1 


Ci  ~  vi(^ 5 


(52) 


r  ■ 
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with 

x^t")  =  -ai(u(t'))  xi(t')  +  b±  x^O)  =  0  (53) 

-A, 

The  system  dynamics  a^(u(t  ))  will  always  be  nonnegative,  and  larger  values 
correspond  to  greater  clarification  by  the  selected  sensor. 

IV.  Optima)  Solution  of  Reduced  Problem 

Here  the  above  problem  is  interpreted  using  the  Pontryagin  minimum  principle 
and  the  geometric  structure  of  the  solution  emerges.  The  first  section  intro¬ 
duces  the  type  of  results  obtained  by  examining  the  binary  hypothesis  problem 
where  only  one  dimension  of  obscurity  exists.  After  stating  necessary  conditions 
which  the  optimal  schedule  must  satisfy  and  deriving  some  of  its  properties, 
the  interpretation  of  these  conditions  in  terms  of  sensor  clarification  co¬ 
efficients  provides  some  preliminary  tests  for  eliminating  sensors  from 
consideration.  Further  examination  of  singular  (mixed)  control  arcs  yields 
more  basic  structure  of  the  schedule  as  well  as  a  classification  of  problems 
in  terms  of  the  sensor  sets.  These  will  be  the  general  results;  section  IV 
will  exploit  the  necessary  conditions  to  compute  the  optimal  schedule. 


Preview  -  Binary  Hypothesis  Testing:  This  special  case  illustrates  some  of 

the  conclusions  that  can  b  e  drawn  about  optimal  schedules .  The  obscurity 

function  between  the  two  hypotheses  H^  and  H 2  has  one  term 

,  ,1/2 

V1  =  (7rl  1T2J 


so  the  continuous  oroblem  to  be  solved  is  to  minimize 


Cj.  x^T) 


with 


x,  (t)  =  a, (u(t) )  x. (t)  ♦  1 

JL  x  ± 


xxiO)  -  0 


i 
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Here  the  solution  is  obvious:  choose  u(t)  to  maximize  the  coefficient 
a^(u(t))  at  each  time  t.  This  corresponds  to  selecting  the  sensor  with 

maximal  instantaneous  clarification  coefficient  a^ (t)  at  each  t. 

-a 

Thus  u  is  chosen  to  maximize  (a  function  of)  a^Ct)  at  each  time,  and 

the  selected  sensor  may  vary  as  a„  (t)  changes  with  time.  No  mixed  controls 

are  required  here,  but  multiple  hypotheses  with  multiple  terms  in  v(tt)  will 

induce  a  directionality  which  requires  mixing. 

-a 

This  can  be  extended  to  the  case  where  u  adjusts  continuous  parameters 
internal  to  a  sensor;  see  Appendix  D  for  an  example  application  in  data 
compression. 


General  Necessary  Conditions:  Necessary  conditions  for  the  dynamic  optimi¬ 
zation  problem  (52,53)  can  be  obtained  from  the  Pontryagin  minimum  principle 
[15,16].  They  are  summarized  in 


Theorem  5:  The  optimal  solution  u(t)  to  (52,53)  satisfies: 


N 

_» 

2 

z. (t)  a.(u*(t))  >  2  z.  (t)  a. (u(t) ) 

for  all  u(t) 

(54) 

1=1 

1  1  ~  i=l  1  1 

z^t) 

=  q^t) 

z^  (0)  =  0 

(55) 

• 

— » 

q^t) 

=  ai(u*(t))  qi(t) 

q.  (T)  *  b.  c. 

U  ii 

(56) 

Proof:  see  Appendix  E.  O 

Because  of  their  associations  with  variables  in  the  proof,  (54)  will 
be  referred  to  as  the  Hamiltonian  condition,  (t)  as  states,  and  q^(t)  as 
costates.  In  addition,  introduce  the  M  vectors 

a*(t)  =  Ca.j(t)  a^t)  . aNj(t)]T  (56) 


1 


-23- 


of  clarification  coefficients  for  each  sensor  and  the  NxM  composite  matrix 


so  that 


A(t) 


a (u  (t) )  ^  ^A_(t)  u(t) 


is  the  vector  of  system  coefficients  a^(u(t)). 

The  necessary  conditions  can  then  be  rewritten  as 


(57) 


(58) 


<z(t),  a(u(t))> 


z(t)  *  q  (t) 


<z (t) >  a(u(t))> 


z  (0)  =  0 


q (t)  -  a(u(t) )  ‘  q(t)  q(T)  -  b*  c 


.  N 


where  <•,*>  is  the  standard  inner  product  in  H  and 
multiplication. 


for  all  u(t)  (59) 
(60) 
(61) 

*  is  componentwise 


Properties  of  z(t)  and  q(t):  Some  properties  of  the  solution  to  (59-61) 

are  immediate.  Note  that  these  describe  a  2N  dimensional  differential 
equation  with  boundary  conditions  at  both  ends  of  the  time  horizon.  The 

solution  approach  here  will  be  to  focus  on  the  initial  conditions  for  z 

— 1  — » 

and  on  the  interrelationship  between  the  dynamics  of  z  and  q. 


Theorem  6:  z(t)  and  q(t)  have  the  following  properties: 

a)  q (t)  >  0 

— & 

b)  q (t)  is  monotonically  nondecreasing 

c)  z (T)  is  monotonically  increasing 

d)  if  q (0)  *  TT  leads  to  a  solution  of  all  conditions  except 
q^T)  =  b  •  c,  then  so  dees  q(Q)  ■  XqQ  for  all  positive  X  . 
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Proof:  a}  ftss^aae  qtt^)  a(U  (TJ)  >  0  for  all  u(t)  (from  theorem  4), 

_*  _4 

so  q(t)  <  0  for  t  >  t  ,  and  hence  q{t)  <  0  for  t  >  t  .  But 
—  —  o  —  —  o 

-»  _v  _J 

q(T)  =  b  •  c  >  0  in  contradiction. 


b  and  c)  immediate  from  a)  and  theorem  4. 

d)  If  (z(t),  q(t))  satisfy  all  conditions  except  q(T)  =  b  •  c, 
then  (yz(t),  yq(t))  also  do  provided  y  >  0.  O 

-»  I] 

Thus  z (t)  moves  outward  into  the  positive  sector  of  B  ,  from  the 

u  _» 

origin, and  q(t)  moves  outward  to  approach  q(T). 

Properties  from  the  Hamiltonian  Condition:  The  Hamiltonian  condition  (59) 

provides  the  key  to  the  geometrical  structure  of  the  problem.  The  vectors 

of  clarification  coefficients  a_.  (t)  describe  M  points  which  are  vertices 
tt  -* 

of  a  polytope  .in  K  .  The  control  u(t)f  by  taking  a  convex  combination  of 

— »  —k  _» 

the  a..  (t) ,  allow  a(u(t))  to  be  selected  anywhere  in  the  interior  or  on  the 
boundary  of  this  polytope.  These  candidates  for  a(u(t))  will  be  called 
the  control  pc ly tope. 

The  condition  (59)  implies  that  u  (t)  must  select  a  point  in  this  control 
polytope  which  has  a  projection  on  z(t)  at  least  as  great  as  any  other 
point  in  the  polytope .  It  follows  immediately  that  a  (u  (t) )  must  lie  on 
the  boundary  of  the  control  polytope,  and  in  fact  an  even  stronger  condition 
holds. 


Theorem  7:  Sensor  j  will  not  be  used  at  time  t  for  any  choice  of  b,  c 
or  t if  there  exists  a  "onnegative  vector  r  such  that 


a.  (t)  <  A (t)  r(t) 
^o 
n 


(62) 


E  i  j  (t)  =  1 


j-1 


r  (t)  =  0 


(63) 


F 
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l 


Proof:  Assume  (62,63)  hold.  Then  for  any  nonegative  z, 

-i  -o  -i  -* 

<z,  a  ^  (t)>  <_<z,  a(t)  r  (t)>  =  <z,  a(r(t)>  (64) 

o 

so  u(t)  =  r(t)  provides  at  least  as  large  a  value  of  the  Hamiltonian  as  does 
a^  .  Since  indeed  z(t)>^  0  for  all  time,  will  never  be  selected  at  time  t. 

The  converse  of  this  theorem  is  also  true,  but  will  not  be  proven  here. 

The  import  of  this  theorem  is  that  it  provides  a  convenient  test  for 
determining  whether  each  sensor  falls  into  the  following  class. 

Definition:  A  sensor  is  superfluous  at  time  t  if  it  will  not  appear  in  the 
optimal  schedule,  for  any  problem,  at  time  t.  It  is  completely  superfluous 
if  it  will  not  appear  in  such  a  schedule  at  any  time. 

Henceforth,  consider  only  the  set  of  nonsuperfluous  sensors  at  each  time  t. 

Returning  to  the  basic  geometry  of  the  Hamiltonian  condition,  the 
following  concepts  are  helpful: 

Definition:  Two  sensors  are  adjacent  if  the  line  segment  connecting  their 
vectors  of  clarity  coefficients  lies  completely  in  the  convex  hull  of  the 
control  poly type. 


Definition:  A  face  of  the  control  poly type  is  a  collection  of  mutually 
adjacent  sensors.  Paces  can  be  of  any  dimension,  from  1  (a  single  sensor) 
to  N  -  1. 

Now  we  find 

Theorem  8:  The  solution  to  the  Hamiltonian  condition  (59)  is  unique  unless 
z  is  normal  to  the  line  connecting  two  adjacent  sensors. 


Proof:  (59)  i.3  the  maximization  of  a  piecewise  linear  function  of  u 


subject  to  a  linear  constraint  and  nonnegativity  requirements.  It  is  well 
known  that  this  maximum  occurs  at  a  vertex  of  the  function.  It  is  unique 
unless 


<zf 

for  two  vertices 


a.  (t)>  =  <z,  a.  (t)> 

H  32 

j1  and  j2,  i.e. 


(65) 


<z,  a.  (t)  -a,  (t)>  *  0  (66) 

Jl  J2 


Also,  the  maximum  occurs  at  any  convex  combination  of  a.  and  a.  . 

_>  31  32 
(66)  implies  z  is  normal  to  the  line  connecting  the  two  vertices;  the 

convexity  condition  requires  that  the  segment  connecting  the  vertices  also 

be  on  the  boundary  of  the  control  poly tope,  i.e.  that  j  and  j„  be  adjacent. 


This  defines  switch  curves  from  one  sensor  to  an  adjacent  one;  u  (t) 

takes  on  different  valu  s  (and  is  pure,  selecting  only  one  sensor)  on 

opposite  sides  of  the  hyperplane  defined  by  (66) •  All  switch  curves  are 

linear  manifolds  including  the  origin,  hence  the  regions  between  these 

2N 

curves  are  sectors  of  K  (Figure  3) .  Since  u  is  constant  within  each 

_» 

sector,  the  overall  dynamics  of  (z  (t) ,  q(t))  are  piecewise  linear,  with 
the  pieces  being  sectors.  As  z(t)  moves  from  sector  to  sector,  it  selects 
a  sequence  of  sensors,  where  each  sensor  follows  another  which  is  adjacent 
to  it.  This  property  helps  reduce  the  number  of  possible  sensor  sequences  - 
any  schedule  with  nonadjacent  sensors  in  succession  is  not  optimal. 


What  happens,  though,  if  z(t)  does  not  move  from  sector  to  sector,  but 
stay3  on  the  boundary  separating  two  or  more  sectors? 


Singular  Control  Arcs:  When  z(t)  stays  on  a  switch  curve  for  a  nonzero 


interval  of  time,  the  Hamiltonian  does  not  provide  a  unique  choice  for  u  (t) 
for  that  interval.  However,  the  requirement  that  z(t)  stay  on  the  curve 
for  the  continuous  interval  implies  additional  conditions  [3.71  namely 


d  -»  -i  -4 

-  <z(t),  a.  (t)  -  a.  (t)>  *  0  (67) 

dt°  31  32 

This  condition  can  be  exploited  to  find  partial  optimal  paths  at  the  start 
of  an  interval. 

Specifically,  consider  singular  control  between  adjacent  sensors  1  and  2. 
Define 

n12(t)  *  ^  (t)  -  a^ (t)  (68) 


as  the  normal  to  the  switch  hyperplane  between  the  two  sensors.  The  selection 


between  them  is  then  made  as 

<  z(t), -rt12(t)>  > 

0 

choose  1 

*4 

<z(t),  n12(t)>  » 

0 

ambiguous 

<2(t),  n12(t)>  < 

0 

choose  2 

In  order  to  maintain  the  ambiguous  case, 


<  q(t),  n12(t)>  +  <  z(t),  n12(t)>  ■  0 
<  a(t)  •  q(t) ,  n^ft)  >  +  2<  q(t)  ,  iv^  (t)  >  +  <z*( t)  ,  n12(t)  > 


(70) 

(71) 


are  necessary  where  a(t)  is  the  vector  of  coefficients  resulting  from  some 

_>  — > 

convex  combination  of  a^(t)  and  a2(t) 


For  states  and  costates  which  satisfy  (69-71) ,  the  behavior  of  their 
trajectories  and  the  conditions  for  singular  control  are  surmnarized  in 
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Theorem  9:  If  (x ) ,  q  (t  )  are  the  trajectories  resulting  from  application 
of  sensor  1  for  a  small  period  of  time  following  t,  (t  e  [t , t-HS  ] )  and  z2(t), 
q2(t)  likewise  for  sensor  2,  both  with  initial  conditions  at  t  satisfying. 
(69,70),  then 


a) 

<21(t)  , 

n^2  (t)> 

> 

<z2  (t)  ,  n12(x)> 

(72) 

b) 

<2^  (T)  f 

nu(T)> 

> 

0  iff 

<  ax (t)  q(t),  n12 (t) >  +  2  <q(t),  n12(t)>  +  <z(t),  n12(t)>  >  0  (73) 

c)  <  z2(x),  n12(x)>  1  0  iff 

<a2(t)  q(t)  ,  n  (t)>  +  2  <q(t)  ,  n*12(t)>  +  <z(t),  n12^t)>  —  °  *74) 


d)  Singular  control  arise  if  and  only  if  both  (b)  and  (c)  hold. 


Proofs  The  proof  is  primarilly  algebraic  and  deals  with  the  evolution  of 
the  projection  of  z  onto  the  normal  to  the  the  switch  hyperplane.  Details 
are  contained  in  Appendix  F.  ° 

Theorem  9  provides  a  way  to  determine  whether  singular  controls  may  exist 
at  a  specific  point?  as  such  it  is  rather  tedious  in  general.  A  broader 
condition  is: 

5 

Corollary  9A:  A  sufficient  condition  for  (73)  to  hold  at  every  point  on  a 
singular  are  along  the  switch  curve,  at  time  t,  is  that  there  exists  a 
y  >  0  such  that  either 
y  n12(t) 

Ta1(t)*n12(t)  +  2  ^(t)  >  n^tt)  (76) 

This  is  also  necessary  if  the  statement  is  to  hold  for  all  such  points; 
this  will  not  be  proven  here. 


5 
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or 


b)  Yr>12(t)  >_  ~n12(t) 


and 


(77) 


Ya1(t)*n12(t)+2  n12't)  -”n12*T* 


(78) 


A  similar  condition,  with  a2 (t)  replacing  a^  (t)  and  inequalities  reversed, 
guarantees  (74)  at  every  point. 


Proof  :  Assume  (a).  Sines  z(t)  >  0,  (75)  implies 


Y<  z  (t) ,  n12  (t)  >  >_  <z  (t) ,  n12(t)> 

Since  q(t)  >_  0  also,  (76)  yields  a  similar  result  which,  when  combined 
with  (79) ,  gives 

2 

~2  '  '  "12' 


(79) 


Y  '^“2<  *(t)  /  n^2 (t) >  >_  <  afft)  ,  n*12(t)> 

dt 


—  dt 


(80) 


By  (70)  both  of  these  are  nonnegative  at  each  point  on  the  curve;  since 
Y  >  0,  (73)  must  hold. 

Assume  (b) .  Identical  arguments  give 

Y  — 2  <  2  (t) '  n12  (t>  >  -  "  ft  <Z  (t)  '  ni2  (t)  51 
dt 


(81) 


and  (70)  again  gives  the  latter  as  equal  to  zero  along  a  singular  curve, 
so  (73)  still  holds. 

The  condition  for  (74)  is  proven  in  the  same  way.  q 

This  provides  a  quick  test  to  see  if  singular  controls  can  be  maintained 
for  arbitrary  periods  along  a  switch  curve.  Failure  of  the  conditions  of 
corollary  9A  to  hold  require  more  detailed  examination  for  singularity  using 


theorem  9. 
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A  final  point  on  the  topic  of  singular  controls  is  to  note  that  it  is 
possible  to  maintain  singular  sires  using  more  than  two  sensors.  Since  this 
requires  z(t)  to  stay  on  the  switch  curve  between  each  pairs  of  sensors, 
it  is  necessary  that  all  sensors  involved  be  adjacent.  The  conditions  for 
the  existence  of  singular  controls  are  the  union  of  the  pairwise  conditions 
above;  for  computation  of  the  actual  mixture  of  sensors,  see  appendix  F. 

Thus  singular  controls  can  exist;  they  will  form  the  backbone  of  the 
solution  to  a  well  structured  class  of  problems  discussed  in  section  V. 


Crossing  of  Switch  Curves:  While  theorem  9  gives  conditions  for  singular 
controls,  it  also  provides  a  great  deal  of  information  concerning  how  and 
when  various  switch  curves  may  be  crossed.  Successive  sensors  in  an  optimal 
schedule  must  be  adjacent  to  one  another  and  this  limits  the  set  of  candidate 
orderings;  exploitation  of  theorem  9  allows  further  limitations  to  be 
considered. 


The  focus  of  this  development  is  on  the  linear  manifold  in  3R 


2N 


<z(t),  n12(t)> 


<  q(t),  n12(t)>  +<z(t),  n12(t)> 


2n 


(82) 

(83) 


Being  linear,  this  manifold  separates  B  into  four  subsets  corresponding 
to  the  possible  inequalities  replacing  «  in  (82,  83) .  The  partition  (82) 
in  z  -  space  is  the  switch  curve;  the  companion  partition  in  q  -  space  (83) 
determines  which  direction  z* will  be  driven  from  the  switch  curve  by  the 

-A 

current  value  of  q  .  For  simplicity,  consider  only  the  half  space 


<  z(t) ,  n12 (t)>  >  0 


(84) 


Identical  results  follow  for  the  other  halfspace  using  n (t) ,  and  theorem  9 
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dealt  with  the  case  of  equality. 

_»  _»  _s 

Suppose  z(t)  is  in  the  sector  where  (t)  is  optimal,  and  q(t)  in  the 

half  space 


<  q(t) ,  n12<t)>  +  <  z (t) ,  n12(t)>  >_  0 


(85) 


Since  this  implies 


—  <  z(t),  n12(t»  >  0 


(86) 


z(t)  must  be  moving  further  into  the  interior  of  the  sector  and  away  from 
the  switch  curve. 

If  q (t)  is  in  the  open  half  space  complementary  to  (85),  then 

^  <  z(t),  «*12(t)>  <  0  (87) 

and  z ( t)  is  approaching  the  switch  curve .  If  conditions  can  be  found 
to  quarantee  that  q(t)  lies  in  one  or  the  other  of  these  half  spaces  for 
all  time,  then  an  important  characterization  of  possible  switches  is 
obtained . 

This  involves  conditions  under  which  q(t)  crosses  the  boundary  (83) 

_»  -A 

when  a1(t)  is  applied.  Suppose  (85)  holds  but  q(t)  reaches  the  surface 
(83) .  If 


f^-  <  q  (t) ,  rf12  (t)  >  +  ft  <  Z  '  ni2  (t)  >  —  0 


(88) 


q  will  not  move  off  of  the  boundary  and  enter  the  interior  of  the  complement 
(85).  Hence  (86)  holds,  and  hence  a2(t)  will  not  become  optimal.  Fortunately, 
(68)  is  the  same  as  (73)  for  which  a  sufficient  condition  was  given  in 
corollary  9A. 

Definition:  The  switch  surface  between  two  adjacent  sensors,  say  1  and  2, 


?  ■ 


is  closed  under  sensor  1  if  q(t)  cannot  leave  the  sector  (85)  when 


sensor  1  is  being  applied. 

Corollary  9B:  A  sufficient  condition  for  the  surface  between  1  and  2  to 
be  closed  is  (75-78) . 

_»  —i 

Proof:  Above  (note  75-7?)  do  not  involve  either  either  z(t)  or  q(t)  explicitly). 

The  principal  implication  of  this  development  is  that  if  sensor  1 

_  _» 
is  activated  by  z(t)  entering  the  interior  of  its  sector,  q(t)  satisfies  (85) 

at  that  point,  and  the  surface  between  1  and  2  is  closed,  then  sensor  2 

cannot  follow  sensor  1  in  an  optimal  sequence. 

Corollary  9C:  A  surface  which  is  closed  in  both  directions  supports  singular 
arcs. 

Proof:  Closure  implies  that  the  application  of  each  control  represented 

.jk 

on  either  side  of  the  surface  will  drive  a  q(t)  on  the  surface  into  the 
respective  interiors  of  the  half spaces.  Hence  the  conditions  (73,74)  of 
theorem  9  are  met  and  singular  controls  are  possible.  Q 

Thus  there  is  an  interesting  relationship  between  closed  surfaces 
and  singular  controls:  while  a  surface  closed  in  both  directions  precludes 
scheduling  1  before  2,  or  vice  versa,  and  hence  cannot  be  crossed,  it  does 
support  singular  arcs  which  follow  the  surface  and  branch  at  some  point 
into  one  or  the  other. 

Example:  Consider  the  stationary  clarification  vectors 

-  U  2  3] 

■  [2  3/2  1] 


[5/4  7/4 


13/4) 
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The  surface  between  and  a2  is  closed,  since 

nj  *  [-1  .1/2  2] 

and  (75,76)  satisfied  with  V*  2/3.  (76)  becomes 


"l  .  •  -1 

’-1 

2/3 

2  .  1/2 

> 

1/2 

3  •  2 

2 

. 

It  is  closed  in  the  opposite  direction  also,  since  n^2  =  ~n12  and 
Y  *  1  satisfies 


Y  a2  *  n21  -  n21 


Routine  verification  shows  that 


a  =  1/3  a^  +  2/3  a2 


(99) 


(90) 


does  indeed  maintain  a  singular  arc  as  predicted  by  corollary  9C. 

-i  — > 

However,  the  surface  between  a^  and  a^  is  not  closed  in  either 
direction.  Note  that  in  the  stationary  case,  the  partition  of  q  space  is 
independent  of  z(t)  as  the  second  term  in  (85)  drops  out.  (85)  then  requires 


qi  + 


Applying  theorem  9  at  each  point,  of  this  surface  gives 


(91) 


<Vq  {t)'  r*13>  — ■  ° 

iff  qx  1  q3 

(92) 

-i  -a  -i 

<  a3  '  q(t)'  VI0 

iff  qL  <  3q3 

(93) 

Figure  4  shows  a  view  along  this  switch  surface;  the  upper  arrows  indicate 
motion  towards  or  away  from  the  surface  when  z  calls  for  a^;  while  the 
lower  do  so  for  a2«  Three  regions  exist:  q  pierces  the  surface  regardless 


T 
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FIGURE  4 


Example  of  Switch  Surface  structure 


convex  hull  of  control  polytype 
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FIGURE  5 


Example  of  Sensor  Types 
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of  z  if  <  q3;  singular  arcs  rosy  exist  for  <j2  <  q^  <_  3q3*  and  q  crosses 
the  boundary  towards  the  (85)  halfspace  if  q^  >  .  Any  subset  of  these  three 

types  of  behavior  may  occur  in  a  boundary  wnich  fails  to  satisfy  corollary  9A. 

These  results  give  a  final  classification  for  sensors. 


Definition:  A  primary  sector  j  of  the  optimal  dynamics  is  one  in  which  all 
switch  surfaces  with  adjacent  sensors  are  closed,  for  all  t,  from  each  of 
them  into  j.  A  primary  sensor  is  one  activated  when  z(t)  lies  in  the 
corresponding  primary  sector.  All  sectors  and  sensors  which  are  not  primary 
are  secondary . 

Figure  5  shows  some  examples  of  primary,  secondary,  and  superfluous 
sensors  in  a  two  dimensional  case.  Points  representing  the  clarification 
vectors  of  each  sensor  are  connected  where  they  are  adjacent. 

The  distinction  which  primary  sensors  bear  is  given  by 


Theorem  10:  For  each  primary  sensor  jQ,  if 


<q(0) ,  a.  >  <max  <q(0) ,  a.> 
3o  j  3 


(94) 


then  sensor  j  will  never  appear  along  the  resulting  z(t),  q(t)  trajectory. 


Proof:  Since  z(0)  =0,  (85)  becomes 


<q{0) ,  n . .  (0)>  >  0 
3  jo 

when  (94)  holds.  Since,  for  sufficiently  small  6  . 


(95) 


<z(<5),  a_.  (6)  >  =  <5<q(0),  a.(0)>  (96) 

a  _  _> 

at  time  6  z  selects  some  j  ?  j  as  optimal  and  q(6)  z  q(0)  implies  q(<S) 

satisfies  (85)  strictly.  Since  the  switch  surface  from  each  j  adjacent 
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to  jQ  us  closed  from  j  to  j  ,  z(t)  will  never  move  into  the  sector  corresponding 

to  j  and  hence  j  will  not  be  selected.  ^ 

o  o 


Corollary  10A:  A  primary  sensor  j  may  be  activated  only  if  it  is  uniquely 
selected  at  t  =  0,  or  if  a  singular  arc  is  followed  which  lies  in  some  switch 
curve  on  the  boundary  of  sector  j  . 

Proof;  With  theorem  10  eliminating  j  from  consideration  if  not  selected  by 
q(0)  in  (94),  these  are  the  only  remaining  possibilities. 


A  more  powerful  result  can  also  be  obtained. 


Definition;  A  primary  face  of  the  control  polytope  is  a  set  of  mutually 
adjacent  primary  sensors. 

Corollary  10B;  If  a  schedule  involves  the  sensors  of  a  primary  face,  it  does 

so  by  commencing  with  a  singular  arc  mixing  them.  Until  the  terminal  condition 

is  met,  the  schedule  will  successively  drop  the  primary  sensors  from  the  mix, 

6 

and  invoke  a  sequence  of  secondary  sensors. 


proof:  By  corollary  10A,  if  any  of  the  sensors  on  the  face  are  to  be 

active,  they  must  be  included  in  a  singular  mix  at  t  =  0.  Since  they  are 

mutually  adjacent,  it  is  possible  (albeit  not  necessary)  that  they  all  appear 

in  the  mix.  Since  singular  arcs  are  confined  to  lie  on  switch  surfaces,  which 
2N 

are  not  dense  in  I?  ,  control  must  pass  to  exactly  one  primary  sensor  unless 


A  graphic  view  of  this  solution  in  three  dimensions  involves  imagining  the 
control  polytope  rolling  along  a  plane.  The  plane  is  one  orthogonal  to  z*lt) ; 
as  z(t)  evolves,  the  orientation  of  the  polytope  changes  with  respect  to  the 
plane.  The  plane  initially  includes  an  entire  (primary)  face  of  the  polytope; 
as  time  goes  on  the  polytope  will  roll  off  that  face  and  only  a  subset  of  the 
vertices  (an  edge)  will  intersect  the  plane.  Later,  one  of  these  will  leave 
and  only  the  last  primary  vertex  will  touch  the  plane;  successive  points  of 
contact  will  be  secondary  vertices.  Since  the  point^of  contact  with  the  plane 
is  indeed  the  maximal  projection  of  the  polytope  on  z(t),  this  is  optimal 
motion. 
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the  terminal  condition  happens  to  lie  on  a  switch  surface.  Since  the  other 
sensor  cannot  appear  intermittently  (although  the  values  of  the  mix  variables 
u(t)  may  vary),  they  must  drop  out  successively  until  one  is  left.  Since  no 
other  sensors  can  be  activated  until  z  leaves  the  final  primary  sector  and 
enters  a  secondary  sector,  nothing  can  be  added  to  the  mix  until  secondary 
sensors  appear.  By  corollary  10A,  no  more  primary  sensors  may  appear. 

O 


Conclusion;  This  section  has  addressed  the  optimal  control  problem 

formulated  in  section  III  from  the  original  scheduling  problem.  The  result 
is  a  two  point  boundary  value  problem,  which  in  reverse  time  is 


where 


and 


z(t) 

m 

q(t) 


=  q(t) 

-1  _M 

=  a  (u  (t) ) 


_>  j 

z (0)  =  0 


q  (t)  q(T) 


a(u  (t) )  =  E  a. (t)  u. (t) 


<z(t),  a(u  (t)>  >  <z(t),  a(u(t))> 


(97) 


(98) 


(99) 


for  any  ulc)  admissible.  In  addition,  structural  characteristics  of  sensors, 
which  may  be  easily  tested,  are  available  to  constrain  the  set  of  possible 
sensor  sequences  which  may  be  optimal .  In  fact ,  the  general  structure 
culminating  in  theorem  10  and  its  corollaries  provides  most  of  the  solution 
for  a  large  class  of  problems. 


V.  Computation  of  Optimal  Schedules 

Returning  from  the  optimal  control  problem  above  to  the  original  scheduling 
problem  involves  reinterpreting  the  previous  results  in  the  context  originally 
developed  in  sections  II  and  III. 
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Begin  by  returning  to  forward  time,  so  (97)  becomes 

z*(t)  -  -q(t)  z*(T)  -  0  (100) 

q(t)  =  -a(u(t) )  q (t)  q(0)  =  c  •  b 

as  the  two  point  boundary  value  problem.  (98,99)  are  static  conditions  and 
remain  unchanged.  The  solution  u(t)  specifies  the  fraction  of  effort  to 
be  devoted  to  each  sensor  at  time  t.  This  section  will  discuss  the  numerical 
solution  of  these  equations  using  structural  knowledge  obtained  in  section  IV; 
specialization  to  the  cases  of  primary  and  stationary  sensors  yields  more 
specific  techniques. 


numerical  solution;  Solution  of  the  equations  (100)  can  be  achieved  by  a 
number  of  techniques, such  as  an  iterative  strategy  which  refines  guesses  of 
the  unknown  boundary  conditions  at  each  end  of  the  time  interval  [18] . 

While  the  suitability  of  various  techniques  will  depend  heavily  on  the  dynamics 

-X 

of  the  a^  (t) ,  the  following  procedure  is  suggested  for  those  problems  where 
these  coefficients  vary  slowly. 


1.  Initial  guess:  z(t)  =  0,  0  <_  t  <  f 

2.  Integrate  the  equation  for  q  forward  in  time,  selecting 
u(t)  using  an  approximation  based  on  the  partitions  induced 
on  q  space  by  (83) 


u(t)  *  arg  max  <q(t),  a(u(t))> 
u 


(101) 


to  get  an  initial  guess  of  q(T),  denoted  q(0;T) 

3.  Integrate  (100)  forward  using  q(k;T)  as  terminal  condition  q(T). ; 


i.e.,  by  reversing  the  transformation  (51) 
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store  the  switch  schedule.  Obtain  z(k;0)  as  an  estimate 

-j 

of  the  initial  condition  on  z. 

_>  _» 

4.  Integrate  (100)  backwards  using  z(0)  *  z(k;0)  and  the 
schedule  obtained  in  (3) ;  obtain  q(k+l;T)  and  repeat 
3  and  4. 


The  advantages  of  this  technique  are  that  only  the  switch  times  (and 
mixes  over  singular  arcs)  need  be  stored  from  iteration  to  iteration,  rather 
than  either  the  entire  z  or  q  trajectories.  For  certain  classes  of  problems 
it  converges  in  one  step;  these  are  discussed  below.  No  other  general 
properties  of  this  solution  are  known  at  this  point. 

This  provides  a  technique  to  apply  to  complex  problems  with  little 
structure;  special  cases  with  strong  structure  can  lead  to  much  simpler 
solutions. 


Primary  Sensors;  The  corollaries  to  theorem  10  shed  light  on  the  special 
case  of  primary  sensors,  where  z(t)  is  forced  tc  lie  within  a  closed  primary 
sector  as  long  as  q(t)  satisfies  the  condition  (85)  for  each  sensor  pair 

j  _» 

defining  the  sector;  i.e.  q(t)  lies  in  a  convex  region  in  q  space.  If  a 

_»  _> 

pair  z(0),  q(0)  can  be  found  so  that  these  satisfy  all  constraints 


<  z  (0,  n  (0)  >  _>  0 


c  q (0) ,  >  +  <  z (0) ,  n  ^ (0) >  0 


(102) 


for  a  primary  sensor  j  and  all  adjacent  sensors  1,  we  can  conclude  that 
this  initial  condition  corresponds  to  the  terminus  of  generic  singular  arc 
as  described  in  corollary  10A  (in  reverse  time) .  In  particular,  a  non- 
negative  z (0)  satisfying  (102  5  without  equality,  for  the  known  q(0) 
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8 

guarantees  that  sensor  i  is  the  optimal  sensor  to  apply  to  time. 

Inequalities  (102-103)  define  2M'  linear  constraints  which  the 

4 

nonnegative  2(0)  must  satisfy,  where  M'  is  the  number  of  adjacent  sensors. 

If  a  solution  is  found  with  one  or  more  of  these  constraints  satisfied  with 
equality,  then  the  optimal  sensor  selection  at  time  0  is  mixed. 

This  special  case  immediately  suggests  using  an  open  loop  feed  back 
strategy,  where  at  each  point  in  time  the  initial  sensor  is  recomputed  using 
(102  -  103)  repeatedly.  If  singular  controls  should  arise,  they  may  be 
approximated  by  a  pure  choice  of  a  sensor;  receiving  an  observation  will 

_j  _ i 

update  the  conditional  probabilities,  hence  c,  and  hence  q  which  will  fall 
on  the  same  boundary  at  the  next  instant  of  time  with  negligible  probability. 
This  suggests  a  discrete  time  implementation,  with  the  sensor  choice  held 
fixed  over  the  sample  period  5  ,  and  6  satisfies  (45) . 

Stationary  Primary  Sensors:  If  a  particular  primary  sensor  and  it  adjacent 


sensors  are  stationary, 


(103) 


in  (102)  and  hence  z (0)  does  not  affect  the  satisfaction  of  (102)  at  all. 
Since  the  primary  sensor  j  is  not  superflous,  a  z(0)  can  always  be  found 
to  satisfy  (102);  q  satisfies  (103)  iff 


<q(0) ,  a  .  >  >  <  q(0) ,  aj.  > 


(104) 


Thus  if  a^  is  chosen  to  achieve  (104)  and  q(0)  lies  in  the  corresponding 
primary  sector  defined  by  the  stationary  version  of  (85) 


<  q(0)  ,  n  £>  _>  0 

then  sensor  j  is  optimal  to  apply  at  t  =  0. 


(105) 


The  question  of  uniqueness  can  not  be  addressed  here,  but  it  is  conjectured 
that  such  an  i  is  unique. 
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in  particular,  if  all  sensors  are  primary,  the  optimal  openloop 

feedback  law  can  be  computed  using  only  (104).  Also,  the  entire  schedule 

_» 

may  be  computed  in  these  cases  using  (104)  to  select  the  sensor  as  g  is 
integrated  forward  in  time  (and  this  corresponds  to  steps  1  and  2  of  the 
numerical  algorithm  given  above,  partially  justifying  it  as  a  suggested 
approach  to  solving  nearly  stationary  problems) . 

The  interpretation  of  this  rule  in  terms  of  the  control  polytope  is 
that  q ( 0 )  represents  the  need  for  information,  and  one  selects  the  sensor 

j 

which  best  matches  that  need  in  terms  of  its  projection  on  q(0). 

j 

Conclusion;  The  major  structural  component  which  helps  determine  u(0)  is 
that  of  primary  sensor.  By  exploiting  the  implications  of  a  schedule  which 
has  z  and  q  in  a  single  sector,  it  becomes  easy  to  determine  the  initial 
sensor  to  be  selected  whithout  finding  the  entire  schedule,  and  this  greatly 
encourages  on-line  implementations. 

VI.  Examples 

Three  practical  examples  illustrate  both  the  range  of  applications 
for  which  these  results  may  be  used,  and  the  use  of  the  various  pieces  of 
structure  developed  in  section  IV  in  solving  problems. 

Surveillance:  Consider  a  one-dimensional  surveillance  problem,  where  a 
single  physical  sensor  is  to  be  pointed  at  a  number  K  of  discrete  bins, 
each  of  which  may  or  may  not  contain  an  object.  The  signal  received  from 
each  bin  (e.g.  a  radar  reflection  or  an  acoustic  emission)  takes  on  a  fixed 
level  dependent  only  on  the  state  of  the  bin  and  i:  corrupted  by  Gaussian 
noise.  The  optimal  pointing  schedule  for  obtaining  a  clear  picture  of  the 
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contents  of  the  bins  is  to  be  determined. 

Each  of  the  basic  underlying  hypotheses  for  this  problem  specifies 
the  subset  of  bins  which  are  occupied  by  objects.  The  number  of  hypotheses 
grows  esponentially  with  K,  and  leads  to  cumbersome  problems  unless  the 
decomposition  properties  of  the  Cartesian  obscurity  function  (18)  are 
exploited.  Let 


it,  (t)  =  conditional  probability  that  bin  i  is  occupied 

10 

*rr i^(t)  55  conditional  probability  that  bin  i  is  unoccupied 


so 


Vi  <10  -  <iri0  irn) 


1/2 


(106) 


distinguish  between  the  states  of  bin  i. 


Pointing  the  sensor  at  bin  j  produces  a  Gaussian  random  variable 
2 

with  variance  0  and  mean 


m. 


Do 


"11 


if  j  occupied 
if  j  unoccupied 


so#  from  appendix  C,  the  clarification  coefficients  are 


ID 


(m , 

— JQ. 


C  2 

j 


i  t  j 

i  ■  3 


(107) 


It  is  immediate  that  no  sensor  is  superfluous,  and  that  all  sensors 

are  adjacent.  (The  control  polytope  has  vertices  only  on  each  coordinate 
1 


axis) .  Any  y  > 

-aDj 


satisfies 


y 


j  (aj  ‘  V  iaj  '  \ 


(108) 
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so  all  sensors  are  primary.  Since  this  is  a  stationary  problem,  it  is 
optimal  to  select  the  sensor  at  time  zero  which  maximizes 


<  q(0)  ,  a.>  =*  a.,  q  (0)  =  a-.,  b.  c. 

3  33  j  3D  3  3 


where  is  the  obscurity  initially  in  bin  j 


c.  -  Or.  (0)  u.. (0))1/2  (110) 

3  3°  31 

The  interpretation  of  this  role  is  to  first  look  at  the  bin  with 
highest  clarification,  importance,  and  initial  uncertainty.  When  all  bins 
are  equally  important  and  have  identical  priors,  this  rule  selects  first 
that  bin  about  which  greatest  clarification  can  be  obtained,  and  postpones 
looking  into  bins  with  lowest  signal  to  noise  ratio  until  last. 

Implementation  of  the  open  loop  feedback  law  is  quite  simple  in 
this  ease:  maintain  a  list  of  bins  ordered  by 

•jj  bj(irjo(t)  it  (t))l/2  (111) 

At  each  time  sample,  point  at  the  bin  at  the  head  of  the  list,  receive 
yi(t)»  update  and  Tt\ reinsert  bin  j  into  the  appropriate  spot  in 

the  list,  and  repeat. 


Search:  Suppose  there  are  K.  bins  as  above,  with  a  sensor  which  can  point 
at  one  at  a  time,  but  at  most  one  bin  contains  a  target.  Now  there  are  K+l 
hypotheses,  describing  the  location  of  the  object  as  well  as  Hq  representing 
the  absence  of  the  object. 

The  Cartesian  obscurity  function  can  no  longer  be  used  since  an 
observation  of  one  bin  provides  information  which  updates  all  probabilities 


i 
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An  alternative  is  the  uniform  obscurity  function  (13) . 


K-l  K 

Z  Z  (TTk  TT .  ) 

kl',°  k2=kl+1  1  2 


1/2 


(112) 


which  separates  all  hypotheses  equally. 

Using  the  same  sensor  model  as  above,  with 


m . 
30 


m 


m. 


o  jl  1 

for  simplicity  gives  clarification  coefficients 


2 
a . 
3 


(113) 


a (k  k  )  .  - 
12  3 


~  m0)‘ 


if  j  =*  k,  or 

q 

else 


(114) 


where  j  =  1,...,K.  Thus  pointing  the  sensor  at  bin  j  affects  all  terms 
involving  7r_.  by  a  factor  of 


(m,  -  m  ) 
l  o 


(115) 


and  all  others  not  at  all. 

Here  again,  no  sensor  is  superfluous  (since  a(0,j)j  =  a  uniquely 
for  control  j)  and  all  are  adjacent.  Also  a  y  >  —  ensures 

Y  (a(k1>k2)j-  *(k^k^n)  >.  ^(k^k^j  '  Nk^k^f.5 

(116) 

for  all  Jl  ,  so  every  switch  curve  is  closed  in  both  directions  and  all 
sensors  are  primary.  Again  (104)  gives  the  optimal  sensor;  it  maximizes 


K-l 

I 


K 

Z 


*(k, ,k„) j  (lTk, 
kl=0  k2-kl+1  12  12 


1/2 


(117) 
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or  using  (14) 

_  X  — 

a/ir .  Z  /rr  (118) 

3  k=0  k 
k/j 

Thus  the  sensor  is  pointed  at  the  bin  which  has  maximum  obscurity 
between  it  and  all  other  hypotheses.  Note  that  although  vn  defines  a 
~  k(k+l)  dimensional  space  in  which  the  clarification  coefficients  lie, 
and  there  are  only  k  sensors,  the  primary  sensor  structure  still  proves 
useful. 

Testing;  An  object  consists  of  two  parts,  each  of  which  can  be  normal  or 
flawed.  Radiation  can  be  directed  through  the  object  in  three  directions 
(figure  6)  and  the  attenuation  of  the  radiation  measured.  After  normaliza¬ 
tion  the  attenuation  coefficient  is  1  if  both  parts  are  normal, 4*^  if  the 
beam  passes  through  a  flawed  first  part,  and  <J>2  if  through  a  flawed  second 
part  (assume  ^2  <  ^  * 

While  the  uniform  obscurity  function  could  be  used  here,  its  six 
terms  can  be  reduced  to  four  using 

v (IT )  =  (fo  ir^1^2  *  (^0  lr2)1/2  +  (1T1  +  (1T2  1T3)1/2  (119) 

=  v,  (in  +  v  (in  +  v,  (in  +  v  (in 
12  3  4 

where 

=  probability  of  no  flaw 
rr  =  probability  1  is  flawed 
if  =  probability  2  is  flawed 
ir3  *  probability  both  are  flawed. 

Assuming  the  observations  are  corrupted  by  zero  mean,  unit  variance  Gaussian 


i 


-50- 


noise,  and  ignoring  scattering  and  other  effects,  the  clarification  coeffi¬ 
cients  for  this  problem  are  given  in  Table  1. 

Structurally,  no  sensors  are  superfluous;  corollary  9A  assures  that 
the  switch  surface  between  1  and  2  is  closed  bidirectionally,  and  the 
surface  from  1  to  3  and  from  2  to  3  are  closed.  However,  only  one  surface 
from  3  to  the  others  can  be  closed;  if  <J>^  >  <*>2,  the  boundary  from  3  to  1 
fails  the  test  (76-78) . 

For  a  specific  example,  let  ij>  =  1/2,  $2  =  1/4.  Then  (76-78) 

requires 


— 

1 

4 

0 

0 

9/16  * 

9/16 

>  + 

9/16 

9/64  ‘ 

-7/64 

-7/64 

1/64  * 

1/64 

_ 

1/64 

«■* 

for  the  surface  from  3  to  1  to  be  closed,  and  no  /  >  0  exists  for  this. 
However,  the  one  from  3  to  2  is  closed,  so  both  sensors  2  and  3  are  primary 
sensors.  If  q(0)  lies  in  either  of  these,  the  respective  sensor  is  optimal 
at  t  =  0;  if  not,  it  lies  in  the  secondary  sector  1.  Even  detailed, 

j  _J 

point-by-point  analysis  from  theorem  9  shows  q(t) ,  hence  z(t),  can  only 
cross  this  boundary  from  3  to  1  and  there  can  be  no  singular  arcs. 

Thus,  if  q(0)  lies  in  sector  1,  i.e. 


<q(0),  a1>  >  <q (0) ,  a2>, 


<q(0)  ,  a  >  >  <q  (0)  ,  a^ 


the  most  general  schedule  possible  is 

a)  use  1  alone  for  T  seconds,  then 

b)  use  3  alone  for  T2  seconds,  then 

c)  singular  arc  between  1  and  3  for  seconds. 


(120) 
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where  *  T  and  all  are  nonnegative.  The  mix  necessary  to 

sustain  (c)  can  be  found  easily;  solutions  for  1\  are  found  by  integrating 
the  z  and  q  equations  explicitly  and  using  the  terminal  condition  on  q, 
and  the  condition  that  z  must  cross  the  boundary  between  1  and  2  at 
exactly  time  . 

Thus  the  existence  of  secondary  sensors  makes  a  problem  more  complex; 
in  particular,  the  choice  of  sensor  1  or  3  in  this  case  will  depend  not 
only  on  q(0),  but  also  on  T  since  the  above  equations  are  nonlinear  in 

I 

the  T.  s. 
r 


VII.  Summary  and  Future  Directions 

This  paper  has  addressed  the  general  problem  of  selecting  a  sequence 
of  sensor  observations  to  take  in  order  to  acquire  information  to  test  the 
truth  of  each  of  several  hypotheses.  The  principal  contributions  are: 

a)  a  problem  formulation  with  a  cost  functional  possessing  both 
desirable  qualitative  properties  and  useful  analytic  structure; 

b)  the  evaluation  of  the  information  provided  by  a  sensor  in  terms 
of  a  vector  of  clarification  coefficients, 

c)  classification  of  sensors  into  superfluous,  primary,  and  secondary 
sensors; 

d)  structural  considerations  such  as  adjacency  and  closure  which  limit 
the  set  of  possible  sequences; 

e)  a  numerical  technique  for  finding  a  schedule  in  the  general  case,  and 


f)  special  conditions  under  which  the  first  sensor  in  the  schedule  may 


be  determined  easily. 
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However,  there  are  a  number  of  extensions  to  be  considered.  These 
include 

a)  Optimal  stopping:  by  including  a  "null"  sensor  with  clarification 
coefficients  all  zero,  and  introducing  additional  penalty  for  the  time 
non-null  sensors  are  used,  one  can  consider  terminating  the  sequence  when 
enough  information  is  collected  (see  appendix  G  for  a  binary  example 
related  to  [20] ) . 

b)  Terminal  cost:  the  penalty  function  here  assumes  that  only  obscurity 
over  the  interval  of  interest  is  to  be  considered;  an  additional  term 
penalizing  obscurity  left  at  time  T  would  fit  nicely  into  the  framework 

(and  result  in  a  boundary  condition  on  z  which  is  not  the  origin) . 

c)  Cost  linear  in  ir  :  Costs  of  this  form  are  also  eigenfunctions 
of  the  Bayes  update,  with  eigenvalue  1.  By  making  the  coefficients 

of  these  terms  dependent  on  the  sensor  choice,  however,  one  can  model 
the  fact  that  selection  of  a  particular  sensor  might  be  undesirable 
if  a  particular  hypothesis  were  true. 

d)  Sensor  dynamics:  The  dynamic  optimization  problem  resulting 
from  this  formulation  can  also  be  augmented  with  state  variables 
describing  sensor  dynamics  (e.g.  position  and  velocity  of  a  sensor 
platform) ,  with  controls  affecting  them.  This  would  allow  modelling  of 
dynamic  constraints  which  prohibit  instantaneous  switching  between  sensors. 

e)  Correlated  observation  processes:  This  formulation  requires 
the  observations  to  be  independent  increment  processes  when  conditioned 
on  the  hypotheses.  Relaxing  this  condition  to  allow  them  to  be,  say 
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noise  filtered  through  a  linear  system  where  the  system  parameters  depend 
on  the  hypothesis, would  extend  the  scope  of  applications.  However, 
sensors  would  be  selected  in  this  case  not  only  to  distinguish  among 
hypotheses,  but  also  to  acquire  good  state  estimates  to  aid  this  distinguish¬ 
ing. 

f)  Dynamic  hypotheses:  If  the  hypothesis  changes  over  time,  as  would 
the  state  of  a  Markov  chain,  one  could  model  a  number  of  dynamic  detection 
and  identification  problems.  Conceptually  this  can  be  placed  in  the 
current  framework  by  regarding  each  state  sequence  as  a  hypothesis  with 
time  varying  observation  statistics.  However,  the  state  space  structure 
should  be  exploitable  to  reduce  the  complexity. 

Many  other  opportunities  exist,  but  those  seem  to  be  both  important 
and  relatively  closely  related  to  the  results  here. 
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APPENDIX  A:  NOTATION 


Symbol  Quantity  Defined  (eg.) 


.a 

clarification  coefficient  for  term  i  by  sensor  j 

(40) 

a . 

3 

vector  of 

ij 

(56) 

_j  _>  _» 

a,  a (u  (t) ) 

convex  combination  of  aj  selected 

(50) 

“ij 

eigenvalue  of  mode  i  under  update  by  sensor  j 

(25) 

A 

matrix  of  all  a.  . 

(57) 

b. 

i 

_» 

incremental  cost  of  obscurity  of  form  in  term  i 

(8) 

b 

vector  of  b^s 

(61) 

c . 

i 

terminal  cost  coefficient  (v^(*)  at  t  «  0) 

(31) 

-X 

c 

vector  of  c. ' s 
i 

(61) 

c  (•) 

general  terminal  cost  function 

(E-l) 

d(-) 

general  running  cost  function 

(E-l) 

6 

sample  interval  betweer  .chedule  points 

(43) 

general  state  transit  function 

(E-2) 

T 

nonnegative  scale  factor 

(Thm.6) 

«k 

basic  hypothesis  k 

(1) 

A 

H 

maximum  a  posteriori  estimate  of  H 

(14) 

i 

index  over  terms  in  obscurity  function 

(8) 

j 

index  over  sensors  or  controls 

(24) 

K 

number  of  hypotheses 

(1) 

* 


1  D*  <1  O' 


*•  - 


k 

L 

l 

Xk 

M 

"k 


(k;T) 


a 


t 

t' 


-55- 


index  over  hypotheses  (1) 

number  of  subhypotheses  (19) 

index  over  subhypotheses  (15) 

Poisson  rate  associated  with  hypothesis  k  (B-2) 

number  of  sensors  (47) 

mean  of  Gaussian  distribution  associated  with  (107) 

hypothesis  k  (B-4) 

-i 

number  of  terms  in  v(tt)  (8) 

normal  to  switch  surface  between  sensors  i  and  j  (68) 

covariance  matrix  associated  with  hypothesis  k  (B-4) 

costate  variable  (E-8) 

costate  vector  (E-3) 

probability  of  detection  (D-2) 

probability  of  false  alarm  (D-2) 

conditional  probability  of  hypothesis  k  (1) 

conditional  distribution  on  hypotheses  (8) 

A 

probability  of  H  (14) 

attenuation  coefficient  of  hypothesis  k  (Table 

modified  costate  (55) (e 

modified  oostate  vector  (60) 

th  -» 

k  iteration  on  boundary  condition  on  q  (101) 

coefficient  in  convex  combination  (7)  (63) 

standard  deviation  (107) 

terminal  time  of  problem  (22) 

time  index  (continuous  and  discrete)  (2) 

reverse  time  variable  (51) 


i 


!«l 
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u . 
3 


uo 

u* 

v(ir) 


vi  (it) 
vC(tt) 


vu(tt) 


xi 


*Lo 


z 

rs 

z(k>0) 


fraction  of  time  spent  using  sensor  j  (47) 

input  (control)  vector  (50) 

point  of  symmetry  in  detector  (D-4) 

optimum  value  of  u  (54) 

obscurity  function  (8) 

ith  term  in  obscurity  function  (8) 

Cartesian  obscurity  function  (18) 

uniform  obscurity  function  (13) 

cost-to-go  function  (26} 

state  variable,  amplitude  of  mode  of  cost  function  (27) 

state  vector  (E-l) 

initial  condition  on  (E-2) 

random  output  obtained  when  sensor  j  selected  (2) 

alternate  state  variable  (55) (E-10) 

alternate  state  vector  (59) 

K  iteration  on  boundary  condition  for  z  (101) 


? 


-57- 


APPENDIX  B:  CLARIFICATION  COEFFICIENTS  FOR 
GAUSSIAN  AND  POISSON  PROCESSES 


This  appendix  contains  explicit  values  for  a _ (t)  for  observations 
which  are  Poisson  or  vector  Gaussian  processes.  Let  the  two  hypotheses  in 
the  obscurity  function  be  H,  and  H„;  a. .  is  defined  as 

1  2  i] 


a  (t) 
ij 


-In  I  [pCyl^)  p(y| 

»  — m 


H2)]1/2dy 


(B-l) 


where  this  is  interpreted  as  a  Steiljes  integral  if  the  functions  are  of 
denumerable  support. 


Poisson  process;  If  y (t)  is  the  number  of  events  in  the  tth  sample  interval 

resulting  from  a  Poisson  process  (such  as  photons  striking  a  detector)  which 

is  characterized  by  rate  .  (t)  under  each  hypothesis,  then 

K 


-x. 


(X  (t)n  e 

p(y(t)  -  nj H^)  =  - £7 - 


(B-2) 


(B-l)  becomes 


•«<«  ■  i  lAi  -  ^ 


(B-3) 


Gdussian  process:  If  y(t)  is  Gaussian  with  mean  M^(t)  end  covariance  |^(t) 
under  each  hypothesis,  direct  evaluation  of  (B-l)  ,  completing  the  square 
in  the  exponent,  and  using  a  well  known  matrix  equality  [19]  gives 


i  Lit)  i 

aij  ^  "  2  det  — 2 - '  "  J  det 


(B-4) 


.  (m.  (t)  -  m  (t))  ,  . 

jindstP.(t)  +  — - - - -  (P,(t)  +P_(t))  m2(t)> 

4—2  2  —2  - r - 


which  is  easily  interpreted  as  the  signal  to  noise  ratio  if  L  (t)  * 
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APPENDIX  C:  DIRECT  DERIVATION  OF  CLARIFICATION  COEFFICIENTS 


Clarification  coefficients  for  the  continuous  time  processes  of  Appendix  B 
can  be  derived  directly  without  recourse  to  the  definition  involving  an 
approximation  to  discrete  time  such  as  (51) .  It  is  restrictive  in  that 
the  independence  of  samples  over  time  (when  conditioned  on  the  hypothesis) (2) 
must  be  replaced  with  the  assumption  that  y(t)  is  an  independent  increments 
process  with 


p(y (t+5)  -  y  (t) )  (C-l) 

— 1 

known  for  all  t,  T  and  is  independent  of  y(t)  and  all  other  disjoint  intervals 
of  time. 

Poisson  processes:  Let  (C-l)  be  that  gf  a  Poisson  counting  process, 

so  -X  6 

(X  (t)  6)n  e  k 

p(y (t+5)  -  y (t)  «  n|ty  -  - ^ - 


where  6  is  small  compared  with  the  derivatives  of  X^ (t) .  The  eigenvalue 
(25)  associated  with  a  term  distinguishing  from  H2  is 

-4(AT(t)  -/X7(t) )  “  5 

a12(t)  -  e  z  (C-3) 

by  direct  computation.  As  5  +  0,  the  state  equation 


x(t+5)  -  a12(t)  x ( t)  +  b5 
can  be  written  as 


x(t+<5)  -  x(t) 

6 


a12(t)  -  1 


x(t)  +  b 


(C-4) 


(C-5) 


(C-6) 


x(t)  =  -  -  vTJ(t))2  x(t)  +  b 

or  indeed 

x(t)  =  -  [-  In  o  2(tJ]  x(t)  +  b  (C-7) 

=  -  a  2(t)  x(t)  +  b  (08) 


Gaussian  processes:  Here  the  distribution  (C-l)  for  a  small  interval  is 

_> 

Gaussian  with  mean  m(t)<5  and  covariance  P^(t)<5  .  The  eigenvalue  associated 
with  a  term  distinguishing  and  H2  for  samples  at  t  and  t+6  is 


*12m 


2 

det  ( 

a  ■ 


2  P1(t)6  +  P2(t)6^1 


1/4 


(09) 


mJL(t)  -  m2(t)  T 


m  (t)  -  m,  (t) 

)*  IPjtt)  +P2(t)]  (-± - j - ) 


which,  after  limiting  as  in  (C-4,  c-8)  gives  a^2(t)  as  (B-4) 


2 
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APPENDIX  D:  APPLICATION  TO  OPTIMAL  DETECTOR  DESIGN 


The  problem  of  detecting  a  known  signal  in  noise  has  been  well  studied 

[1] .  Essentially,  it  is  a  binary  hypothesis  testing  problem  -  assumes 

no  signal  present  and  H2  corresponds  to  the  existence  of  a  signal.  The 

detector  observes  the  signal,  if  any,  corrupted  by  noise  and  makes  a 

decision  as  to  whether  or  H2  is  the  actual  case. 

There  are  a  variety  of  criteria  for  judging  detector  performance: 

Bayes, Risk  Neyman-Pearson,  etc.  All  result  in  a  detector  which  computes 

the  posterior  probabilities  of  and  H2  given  the  observed  waveform,  then 

compares  their  ratio  to  a  fixed  threshold.  As  the  threshold  is  varied, 

the  probability  of  falso  alarm  p  (choosing  H  when  H  is  true)  and  the 

F  2  1 

probability  of  detection  pQ  (choosing  H2  when  is  true)  vary.  These 
criteria  give  formulae  for  setting  the  threshold  in  some  optimal  way. 

This  appendix  proposes  yet  a  third  way  to  select  the  threshold  as 
an  example  of  the  application  of  the  binary  hypothesis  testing  result 
of  section  IV.  Suppose  the  detector  is  used  for  data  compression  - 
reducing  its  entire  observed  waveform  to  a  binary  choice  between  and 
Hj.  This  choice  is  to  be  communicated  to  some  other  point  where  it  will 
be  used  as  a  basis  for  some  decisions  and  hence  minimum  obscurity  is  desired. 
The  result  of  section  IV  says  that  the  clarification  coefficient  of  the 

I 

detector  should  be  maximized;  here,  the  variable  which  can  be  selected 
is  the  threshold  in  the  detector. 

Recall  that  maximizing  the  clarification  coefficient  is  equivalent 
to  minimizing 

2  A  ^ 

E  (pChJh^  p(hJh2))1/2 


Ct>-1) 
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where  H  is  the  output  of  the  detector.  In  terms  of  and  p?  this  is 

<1  -  pF(u)  (1  -  pD(u)))1/2  +  (pptu)  pD(t))1/2  (D-2) 

where  PF(u)  and  pQ (u)  are  the  false  alarm  and  detection  probabilities  when 

u  is  used  as  the  threshold.  Assuming  differentiability,  a  necessary 
* 

condition  for  u  ,  the  optimum  threshold  is  that 

PD(1  -  Pp)  (1  -  PF))V2  -  (1  -  Pp)  (Pp  PF)V2 

PF  (1  -  pD)  (1  -  pf)51/2  -  (1  -  pf) (pD  pf)1/2 
u* 

Consider  u  ,  the  t  value  of  u  for  which  p_(u)  =  1  -  p_(u)  (which  exists 
o  u  r 

for  PD ( * )  and  pf ( • )  are  continuous . )  If  enough  symmetry  is  present  so 
that 

dpF 

as  — ■ 

du 

U  U  (D-4) 

o  o 

then  the  above  condition  is  satisfied.  For  the  case  of  a  signal  in 
additive  white  Gaussian  noise,  this  holds  and  yields,  incidentally,  the 
minimum  probability  of  error  rule. 

In  summary,  this  appendix  illustrates  how  the  techniques  developed  here 
can  be  applied  to  specification  of  sensor  parameters  as  well  as  sensor 
selection.  The  resulting  condition  for  setting  detector  thresholds  is 
interesting  in  that  it  arises  naturally  from  an  information  theoretic 
approach,  and  depends  only  on  internal  sensor  characteristics  without 
recourse  to  prior  probabilities  or  to  cost  parameters. 


u*  (D-3) 
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APPENDIX  E:  DERIVATIOIN  OF  NECESSARY  CONDITIONS 


Given  an  optimal  control  problem:  minimize 
c(x(t))  + 

with 

-  f^xCt),  u(t),t)  x^O)  -  xio  i»l,2 . ,N 

and  with  fixed  terminal  time  T  and  no  terminal  constraints  on  xt 
Pontryagin  principle  gives  the  following  necessary  conditions. 

The  Hamiltonian  function  is  defined  as 


r. 


d(x(t) ,  u(t))dt 


(E-l) 

(E-2) 


H(x,p",u,  t)  *  p^t)  fdT,u,t)  +  d£,u,t)  (E-3) 

wwhere  p(t)  is  an  N-dimensional  costate  vector.  The  necessary  conditions 

to  be  satisfied  are 


1.  Xj^Ct)  -  fi(x*(t),  p* (t)  u  (t),t) 

x*(°)  -  xiQ 

(E-4) 

2.  p*(t)  -  -  H(^*(t),  p*(t),  ^*(t),t) 

i  ox^ 

pJ(T>  "3^  C(^(T)) 

(E-5) 

3.  H(x*(t),  p(t),  u(t),t)5H(x*(t),p(t), 

^(t),t) 

for  all  u(t) 

(E-6) 

In  the  problem  (52,53)  these  become: 

N  _> 

H(x(t),  p(t),  u(t),t)  -  E  -p(t)  xt(t)  ai(u(t))  +  pt(t)  b± 

1.  xt*(t)  »  -at(u*(t))  x*(t)  +  bt 

x*(0)  *  0 

(E-7) 

2.  p^t)  -  ai(u(t)  p^t) 

P*(T)  -  Ci 

(E-8) 

N  *  *  _*  N  * 

3.  E  p  (t)  x  (t)  a  (u  (t))  ^  E  p  (t) 

i-1  1  1  1  i-1 

x*(t)  at  (u(t)) 

for  all  u(t) 

(E-9) 

The  Hamiltonian  condition  (E-9)  suggests  the  change  of  variable 
s^t)  “  P1(t)  Xj.Ct) 


(E-10) 
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which  satisfies  (using  (E-7,  E-8) ) 

zi(t)  =  bi  pi(t)  zi(0)  =  0  (E-ll) 

Define  also 

qi(t)  =  bi  pi(t)  (E-12) 

which  (E-8)  gives  as  satisfying 

^i (t)  ”  ai(u  (t))  qi(t)  qi(T)  *  bi  ci  (E-13) 

Finally,  the  Hamiltonian  condition  itself  becomes 

N  _»  N  j,  _> 

Z  z.  (t)  a.  (u  (t)  _>  Z  z.  (t)  a.(u(t))  for  all  u(t)  (E-14) 

i-1  11  i-1  1  x 

(E-ll,  E-13,  E-14)  are  the  conditions  which  appear  as  (54-56). 


Finally,  the  total  cost  is 


N  N 

Z  c.  X.  (T)  =  Z  p  (T)  x.  (T) 
i-l  i-1  1 

N 

=  Z  z . (T)  (E-15) 

i-1  1 


7 


(F-8) 
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The  condition  (F-2)  rewritten  as 


dt  n12<t)> 


—  <z2(t)  n12(t)> 


(F-9) 


ensures  that  (F  -  3)  holds  for  a  short  period  after  t. 


b)  <z1(x),  n12(T)>  >_  0  iff 


<&1(t)*  q  (t) ,  n12  (t)  >  +  2<q  (t) ,  (t)  >  +  <z '.t)  ,  n^2(t)>  >_  0  (F-10) 

Interpretation  Sensor  1  can  drive  z  into  the  sector  where  it  is  solely 

optimal . 

Proof;  As  above,  (F-2)  implies  that  at  time  t 


^  <z(t),  n12(t)>  =  0 


(F-ll) 


The  <zi(t),  n^2^>  increases  if  the  second  derivative  at  t  is 
positive;  (F-10)  ensures  this. 


c)  Similar  to  (b) ,  only  driving  z(t)  into  the  sector  where  2  is  solely 
optimal  requires  the  second  derivative  to  be  negative. 

d)  Singular  controls  exist  iff  both  condition  (F-lO)and  its  companion 

hold. 


Interpretation :  z(t)  can  be  maintained  on  the  switch  curve  iff  sensor 
1  alone  can  move  it  into  its  sector,  and  sensor  2  alone  can  move  it  off 
the  curve  into  the  sector  where  a2  is  optimal. 


Proof:  Singular  controls  require 


a*  ->  _» 

— r  <  z(t) ,  n  _ (t) > 
dt  1 


=  0 


(F-12) 


1  ‘ 
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for  some 


with 


a(t)  =  u^t)  a^tt)  +  u2(t)  a2(t) 


u^t),  u2(t)  >  0;  u^Ct)  +  u2(t)  =  1 
Expanding  (P-12)  and  substituting  (F-13),  it  is  necessary  that 


ux(t)  Y L (t)  -  u2(t)  Y 2 (t)  =  0 


where 


Y,(t)  =  <a;L(t)‘  q(t) ,  n12(t)>  +  2<q (t) ,  n12(t)>  +  <z (t),  n12  (t)> 


and 


-A 


-Y2  (t)  =  <a2(t)‘  q(t),  ^12(t)>  +■  2<q(t),  +  <z(t)'  n12(t)> 

(F- 15)  has  the  convex  sum  cf  two  scalars  equal  to  zero;  this  is  possible 
iff  one  is  nonnegative  and  the  other  nonpositive.  Since  (F-8)  implies 


Y i (t)  >  -Y2(t) 

Y^(t)  must  be  nonnegative  and  -Y2(t)  nonpositive. 

Computation  of  singular  controls ;  The  mixture  of  sensors  1  and  2  which 
maintain  singularity,  if  they  exist,  are  given  by 


u  (t) 


Y2(t) 


Yl(t)  +  Y2(t) 


Yx(t) 


Vt}  *  Yx (t)  +  Y2(t) 

Extention  to  several  controls:  With  M*  singular  controls,  (F-13)  is 


M 


Z  u.  a . (t) 

j-1  3  3 


(P-13) 

(F-14) 

(F-15) 

(f-16) 

(F-17) 

(F-18) 

(F-19) 

(F-  20) 


generalized  to 


a(t) 


(F-  21) 
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and  (F-14  )  to 


u  >  o 

3  - 


Pairwise  constraints  give  (F-15 )  as 


Y,(t) 

ui(t)  =  ^TTT  uj(t) 


M’ 

£ 

j=l 


u. 


3 


*1 


(F-22) 


(F-23) 


for  all  sensor  pairs(i,j)  in  the  set.  The  forms  (F-23)  give  at  most  M'-l 
independent  equations  in  u(t) ;  they  give  exactly  M'-l  such  constraints  when 
the  sensors  are  linearly  independent.  Combining  these  with  (F-22)  gives  the 
unique  control  mix  to  maintain  singularity  unless  the  sensors  are  linearly 
dependent;  in  this  case,  all  of  the  available  solutions  are  dynamically 
equivalent. 
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APPENDIX  G:  EXAMPLE  OF  OPTIMAL  STOPPING 

The  general  problem  formulation  of  this  paper  can  be  augmented  to 

allow  explicit  consideration  of  stopping  (or  becoming  inact_ve  for  a  period 

of  time  in  nonstationary  problems) .  The  objective  is  to  capture  the  notion 

that  the  it  ( t)  can  not  be  expected  to  reach  an  extremal  distribution  in  finite 

time.  However,  at  some  point  it  becomes  sufficiently  close  to  an  extremal 

distribution  that  further  effort  is  unjustified.  This  can  be  modelled  by 

augmenting  the  state  vector  with  a  "clock"  state  keeping  track  of  the  time 

a  sensor  is  in  use,  and  penalizing  its  final  value  .  The  clock 

is  turned  off  whenever  a  null  sensor  is  invoked,  one  which  provides  no 

information  on  any  term  in  the  obscurity  function. 

As  this  is  intended  as  a  simple  example,  it  will  consider  only  a 

a 

stationary  binary  hypothesis  testing  problem  (one  term  in  v(tt)).  The 
development  of  this  problem  is  identical  to  the  main  work  up  through 
section  III;  we  pick  up  from  there. 

Problem  statement;  One  of  M  sensors  may  be  selected  at  each  time;  sensor  j 
causes  the  amplitude  x^(t)  of  the  cost- to- go  function  to  vary  as 

x1(t)  =*  -a^  x^(t)  +  b  (G-l) 

A  penalty  on  the  obscurity  remaining  at  the  end  of  the  scheduling  interval 
of  the  form 

d(?r1(T)  VT))1/2  (G-2) 

would  appear  in  the  reverse-time  formulation  as  (see  Theorem  3) 

Xl(0)  =  d  (G-3) 

(It  is  necessary  to  penalize  the  residual  obscurity  with  ad»  b  or  the 


schedule  will  stop  at  t  -  0) 


m 
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To  include  the  time  penalty,  introduce 

x  -  1  x  (0)  *  0 

o  o 

and  the  overall  objective  is  then  to  minimize 

c  x  (T)  +  c.  x,  (T) 

O  o  11 

where  cq  specifies  the  weight  given  to  time  and  is  as  before, 
introduce  the  "wait"  sensor  for  which  the  dynamics  become 

x  =  0  x,  «*  0 

o  1 

as  neither  clarification  nor  penalty  is  accrued  when  it  is  used. 


Necessary  Conditions;  The  maximum  principle  yields  the  following 

conditions  for  this  problem: 

Case  I:  Sensor  j  f  0  is  optimal 

Hamiltonian:  -a^  x^(t)  p^  (t)  +  b  P1 (t)  +  PQ(t) 

States:  x  (t)  -1 
o 

x^t)  -  -a_j  x^t) 

* 

Costate3:  p  ■  0 
o 

Px(t)  -  p:(t) 


Case  II: 
Hamiltonian: 
States: 


Costates : 


Sensor  0  is  optimal 

0 

x  (t)  -  0 

O 

x^t)  =  0 
PQ(t)  -  0 
Pj.Ct)  -  0 


! 


(G-4) 

(G-5) 

Finally, 

(G-6) 

necessary 


i 


Hamiltonian:  Choose  j  to  achieve  the  minimum  value. 

o 

Boundary  conditions: 

x  (0)  -  0  p  (T)  »  o 

o  o  o 

x^O)  »  d  P1(T)  "  ei 

Interpretation:  The  Hamiltonian  condition  guarantees  that  if  a  non-null 
sensor  is  selected,  it  will  be  the  one  with  greatest  a  ^ ,  as  before. 
Dropping  all  others,  the  decision  for  sensors  is 


Define 


so 


o 

0  >  -a1  x1(t)  p^t)  +  b  p1(t)  +  PQ(t) 


21(t)  -  x^(t)  +  b  p^ft)  +  PQ(t) 

z, (t)  «  p  (t)  if  sensor  j  selected 

1  o 

■  0  else 


(G-7) 

(G-8) 

(G-9) 


However,  p  (t)  is  a  positive  constant,  c  ,  regardless  of  schedule.  Hence, 
o  o 

z^{t)  is  monotonically  increasing,  and  the  only  candidates  for  optimal 
schedules  can  switch  from  j  to  0  at  most  once,  and  never  from  0  to  j. 

Let  T^  be  the  total  time  the  sensor  j  is  on,  and  Tq  the  time  it  is 
off.  Then 


T  +  T  -  T  Z, (T, )  -  0  (G-10) 

o  1  11 

give  conditions  for  finding  T^  and  Tq.  Integrating  and  substituting  boundary 
conditions  gives 


W 


Cj.lt>  - 


a^d)e 


Vi 


+  c 


(G-ll) 


which  is  zero  when 
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Implementation:  This  rule  can  be  implemented  in  open  loop  feedback 

fashion  by  noting  that  sensor  j  is  used  at  zero  iff  the  solution  for  T 


in  (G-12)  is  positive,  i.e.  iff 
c! 

—  (a.d  -  b)  >  1 
Co  3 


(G-13) 


Since 


Ctt1(0)  tt2(0)) 


1/2  =  (p(l-p))1/2  <  j 


(G-14) 


it  follows  that  no  sensor  is  ever  used  if 


c  (a.d-b)  > 
°  1 


1/2 


and  j  is  used  only  when 


(G-15) 


(p-l/2)2<~-  — — - - j  (G-16) 

c  (a.d-b) 
o  l 

This  gives  a  decision  rule  which  is  fixed  over  time  but  shares  the 
structural  properties  of  other  stopping  strategies  [20]  in  that  data 
continues  to  be  collected  when  the  conditional  distribution  is  near  the 
center  of  the  [0,1]  interval,  and  ceases  as  it  moves  toward  the  boundaries. 
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